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—i 1 Abstract 

o : 

_ In the store- and-forward routing problem, packets have to be routed along given paths such 

that the arrival time of the latest packet is minimized. A groundbreaking result of Leighton, Maggs 
and Rao says that this can always be done in time 0(congestion + dilation), where the congestion 
is the maximum number of paths using an edge and the dilation is the maximum length of a 
' path. However, the analysis is quite arcane and complicated and works by iteratively improving 

an infeasible schedule. Here, we provide a more accessible analysis which is based on conditional 
ry^ \ expectations. Like ILMR94I , our easier analysis also guarantees that constant size edge buffers 

■ suffice. 

Moreover, it was an open problem stated e.g. by Wiese IWiell |, whether there is any instance 
where all schedules need at least {! + £)■ (congestion + dilation) steps, for a constant £ > 0. We 
'— 1 ' answer this question affirmatively by making use of a probabilistic construction. 

(N : 

oo ! 1 Introduction 

7— i ; 

One of the fundamental problems in parallel and distributed systems is to transport packets within 
a communication network in a timely manner. Any routing protocol has to make two kinds of de- 
cisions: (1) on which paths shall the packets be sent and (2) according to which priority rule should 
. packets be routed along those paths, considering that communication links have usually a limited 

bandwidth. In this paper, we focus on the second part of the decision process. More concretely, 
J> ' we assume that a network in form of a directed graph G = {V,E) is given, together with source sink 

pairs Si, tj e V for i = 1, ...,k and s;-f ; - paths Pj Q E. So the goal is to route the packets from their 
source along the given path to their sink in such a way that the makespan is minimized. Here, the 
makespan denotes the time when the last packet arrives at its destination. Moreover, we assume unit 
bandwidth and unit transit time, i.e. in each time unit only one packet can traverse an edge and the 
traversal takes exactly one time unit. Since the only freedom for the scheduler lies in the decision 
when packets move and when they wait, this setting is usually called store and forward routing. Note 
that we make no assumption about the structure of the graph or the paths. In fact, we can allow that 
the graph has multi-edges and loops; a path may even revisit the same node several times. We only 
forbid that a path uses the same edge more than once. 

Two natural parameters of the instance are the congestion C := max e£ £ \{i | e e Pi\\, i.e. the maxi- 
mum number of paths that share a common edge and the dilation D := max, =1 ^ \Pf\, i.e. the length 
of the longest path. Obviously, for any instance, both parameters C and D are lower bounds on the 
makespan for any possible routing policy. Surprisingly, Leighton, Maggs and Rao |LMR94| could 
prove that the optimum achievable makespan is always within a constant factor of C + D. Since then, 
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Figure 1: Example instance with k = 3, C = 2 and D = 4 



their approach has been revisited several times. First, |LMR99| provided a polynomial time algorithm 
that makes the approach constructive (which nowadays would be easy using the Moser Tardos algo- 
rithm |MT10|). Scheideler |Sch98, Chapter 6] provides a more careful (and more accessible) analysis 
which reduces the hidden constants to 39 (C + D). More recently Peis and Wiese |PW11 1 reduced the 
constant to 24 (and beyond, for larger minimum bandwidth or transit time). 

Already the original paper of |LMR94| also showed that (huge) constant size edge buffers are suf- 
ficient. Scheideler |Sch98| proved that even a buffer size of 2 is enough. However, all proofs ILMR94, 
LMR99 , Sch98] IPW11I use the original idea of Leighton, Maggs and Rao to start with an infeasible 
schedule and insert iteratively random delays to reduce the infeasibility until no more than 0(1) pack- 
ets use an edge per time step (in each iteration, applying the Lovasz Local Lemma). 

In this paper, we suggest a somewhat dual approach in which we start with a probabilistic sched- 
ule which is feasible in expectation and then reduce step by step the randomness (still making use 
of the Local Lemma) . Our construction here is not fundamentally different from the original work of 
|LMR94|, but the emerging proof is "less iterative" and, in the opinion of the author, also more clear 
and explicit in demonstrating to the reader why a constant factor suffices. Especially obtaining the 
additional property of constant size edge buffers is fairly simple in our construction. 

If it comes to lower bounds for general routing strategies, the following instance is essentially the 
worst known one: C many packets share the same path of length D. Then it takes C time units until 
the last packet crosses the first edge; that packet needs D — 1 more time units to reach its destina- 
tion, leading to a makespan of C + D - 1. Wiese IWiell I) states that no example is known where the 
optimum makespan needs to be even a small constant factor larger. We answer the open question 
in IWiell l and show that for a universal constant e > 0, there is a family of instances in which every 
routing policy needs at least (1 + e) ■ (C + D) time units (and C,D -* ooo In our chosen instance, we 
generate paths from random permutations and use probabilistic arguments for the analysis. 

1.1 Related Work 

The result of ILM R94I I LMR 99I could be interpreted as a constant factor approximation algorithm 
for the problem of finding the minimum makespan. In contrast, finding the optimum schedule is 
NP-hard |CI96|. In fact, even on trees, the problem remains APX-hard |PSW09|. If we generalize 
the problem to finding paths plus schedules, then constant factor approximation algorithms are still 
possible due to Srinivasan and Teo | STOO | (using the fact that it suffices to find paths that minimize 
the sum of congestion and dilation) . Koch et al. | KPSW09 1 extend this to a more general setting, where 
messages consisting of several packets have to be sent. 



The constant can be chosen e.g. as e := 0.00001, though we do not make any attempt to optimize the constant, but 
focus on a simple exposition. 



The Leighton-Maggs-Rao result, apart from being quite involved, has the disadvantage of being 
a non-local offline algorithm. In contrast, there is a distributed algorithm with makespan 0(C) + 
(log* n) 0(1 °g* n) D + log° (1) n by Rabani and Tardos IRT96I which was later improved to 0(C + D + 
log 1+£ n) by Ostrovsky and Rabani |OR97|. If the paths are indeed shortest paths, then there is a ran- 
domized online routing policy which finishes in 0(C + D + logfc) steps |MV99|. To the best of our 
knowledge, the question concerning the existence of an 0(C + D) online algorithm is still open. We 
refer to the book of Scheideler |Sch98| for a more detailed overview about routing policies. 

One can also reinterpret the packet routing problem as (acyclic) job shop scheduling J \ pij — 
1, acyclic | C max , where jobs / and machines M are given. Each job has a sequence of machines that 
it needs to be processed on in a given order (each machine appears at most once in this sequence), 
while all processing times have unit length. For the natural generalization (/ | ptj, acyclic | C max ) with 
arbitrary processing times ptj, Feige & Scheideler |FS02| showed that schedules of length 0{L- logL- 

log log L) are always possible and for some instances, every schedule needs at least Q(L • , lo , gI T ) time 
units, where we abbreviate L := max{C,D}|f| Svensson and Mastrolilli IMS11I showed that this lower 
bound even holds in the special case of flow shop scheduling, where all jobs need to be processed on 
all machines in the same order (in packet routing, this corresponds to the case that all paths P,- are 
identical). In fact, for flow shop scheduling with jumps (i.e. each job needs to be processed on a given 
subset of machines) it is even NP-hard to approximate the optimum makespan within any constant 
factor IMS11I . 

In contrast, if we allow preemption, then even for acyclic job shop scheduling, the makespan can 
be reduced to 0(C + D log log max, j pij) |FS02| and it is conceivable that even 0(C + D) might suffice. 

1.2 Organisation 

In Section|2l we recall some probabilistic tools. Then in Section|3]we show the existence of an 0(C+D) 
routing policy, which is modified in Section 0] to guarantee that constant size edge buffers suffice. 
Finally, we show the lower bound in Section [5l 



2 Preliminaries 

Later, we will need the following concentration result, which is a version of the Chernou-Hoeffding 
bound: 

Lemmal (|DP09, Theorem 1.1]). LetZi,. .., e [0,5] be independently distributed random variables 
with sum Z := ELi %i and let p > E[Z] . Then for anye > 0, 

Pr[Z>(l + e)^] <exp(-y -|). 

Moreover, we need the Lovdsz Local Lemma (see also the books |AS08| and |MU05| and for the 
constructive version, see | MT10|). 

Lemma 2 (Lovasz Local Lemma IEL75I ). Let A\,...,A m be arbitrary events such that (1) Pr[A ; ] < p; (2) 
each At depends on at most d many other events; and (3) 4- p- d < 1. Then Pr [dTLi A'] > 0- 

2 In this setting, one extends C = max £ pi j and D = max ^ pij. 

l£M j £ J' j uses i ■' e ^ i^M: j uses i 
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Figure 2: Path P; and its dissection with L = 2. Denote the random waiting time of the jth block in 
level £ by agj. Then the packet would wait for [W\ - a n ) + «i2 + (W2 ~ a 2s) + «24 time units in node u. 



3 O (congestion + dilation) routing 

After adding dummy paths and edges, we may assume that C = D and every path has length exactly 
D. In the following we show how to route the packets within 0(D) time units such that in each time 
step, each edge is traversed by at most 0(1) many packets (by stretching the time by another 0(1) 
factor, one can obtain a schedule with makespan 0(D) in which each edge is indeed only traversed 
by a single packet). In the following, we call the largest number of packets that traverse the same edge 
in one time unit the load of the schedule. 

Let A > be a constant that we leave undetermined for now - at several places we will simply 
assume A to be large enough for our purpose. Consider a packet i and partition its path P ; - into a 
laminar family of blocks such that the blocks on level £ contain D( = D (1 /2) many consecutive edgesH 
We stop this dissection, when the last block (whose index we denote by L) has length between A and 
A 2 . 

In other words, the root block (i.e. the path Pj itself) is on level and the depth of that laminar 
family is L = ©(loglogD) (though this quantity will be irrelevant for the analysis). Each block has 2 
boundary nodes, a start node and an end node. Observe that a level £ block of length D( has children 
of length Dg + i = \f~D~(. Moreover, we define 



Wf.= 




= 
> 1 



The routing policy for packet i is now as follows: For each level £ block, the packet waits a uniformly 
and independently chosen random time x e [1, W(] at the start node^; furthermore the packet waits 
Wf-x time units at the end node (see Figure[2). This policy has two crucial properties: 

(A) The total waiting time of each packet is 0(D) . 

(B) The time t at which packet i crosses an edge e e P,- is a random variable that depends only on 
the random waiting times of the blocks that contain e — in fact, i.e. only one block from each 
level. 



3 Depending on D, the quantity D( may not be integral. But all our calculations have enough slack so that one could 
replace D( with the nearest power of 2. Then we may also assume that for each (, Dg divides D(_i . 
4 We define [a, b] := {a, a + 1, a + 2, . . . , b} as the set of integers between a and b. 
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Let us argue, why {A) is true. The waiting time on level £ = will be precisely D, while for each £ > 1 
the total level-awaiting time for each packet will be ^-W^ = Using the crude bound D( > 4-D^ +1 

we have Dl- ; > 4-', hence on level L- j >0, the total waiting time will be at most -§4- < §t. Thus the 

L-j Z 

total waiting time for a packet, summed over all levels is at most D + DYZ-Zq^)^ = 0{D). In other 
words: each packet is guaranteed to arrive after at most T := 0(D) time units. Note that there are 
instances where the vast majority of random outcomes would yield a superconstant load on some 
edge. However, one can prove that there exists a choice of the waiting times such that the load does 
not exceed 0(1). 

Let X{e, t, i) e {0, 1} be the random variable that tells us whether packet i is crossing edge e at time 
t. Moreover, let X{e, t) = Zf =1 X{e, t, i) be the number of packets crossing e at time t. Since packet i 
waits a random time from [1,D] in Sj, we have Pr[X(e, t, i)] < A for each e, i, t (more formally: no mat- 
ter how the waiting times on level > 1 are chosen, there is always at most one out of D outcomes for 
the level waiting time that cause packet i to cross e precisely at time t). Since no edge is contained 
in more than D paths, we have \E[X{e, f)] < 1. 

In the following, if a e [1, W^] DID( is a vector of level ^-waiting times, then \E[X{e, t) \ a] denotes 
the corresponding conditional expectation, depending on a. The idea for the analysis is to fix the 
waiting times on one level at a time (starting with level 0) such that the conditional expectation 
E[X(e, t)] never increases to a value larger than, say 2. Before we continue, we want to be clear about 
the behaviour of such conditional random variables. 

Lemma 3. Let £ e {0, . . . , L - 1} and condition on arbitrary waiting times for level 0,...,£. Then for any 
packet i, edge eeE and any time t e [T] one has 

a) mX{e,t,i)]<^. 

b) If the event X(e, t, i) has non-zero probability then Pr[X(<?, t, i)] > -r-k- . 

w e+i 

Proof. For [a), suppose also all waiting times except of the level £ +1 block in which i crosses e are 
fixed adversarially. Still, there is at most one out of W^ +1 outcomes that cause packet i to cross e at 
time t. 

For {b), observe that the time at which packet 2 crosses e depends only on the waiting time of the 
blocks that contain e (i.e. one block per level). The number of possible outcomes of those waiting 
times is bounded by iT^o" 1 We+i+j ^ iW e+1 ) L J^ a,2)J = W^ +V □ 

The whole analysis boils down to the following lemma, in which we prove that we can always fix 
the waiting times on level £ without increasing the expected load on any edge by more than D^ 1/32 . 
What happens formally is that we show the existence of a sequence ao, . . . , ocl-i such that <xe denotes 
a vector of level ^-waiting times and 

1 

E[X{e,t)\a ,...,ae-i,a e ]<E[Xie,t)\a ,...,a e -i] + — n ^ \/eeE\/t£[T] (1) 

(given that the right hand side is at least 1). To do this, suppose we already found and fixed proper 
waiting times do,..., ff^-i. Then one can interpret the left hand side of [D as a random variable de- 
pending ona/, which is the sum of independently distributed values — and hence well concentrated. 
Moreover the dependence degree of this random variable is bounded by a polynomial in . Thus the 
Lovasz Local Lemma provides the existence of suitable waiting times ag. 
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Lemma 4. Let £ E {0,...,L— 1} and suppose that we already fixed all waiting times on level 0,...,£-l. 
LetX[e, t) be the corresponding conditional random variable and assume y > max e ^E,K[T] {E[X{e, t)]} 
and 1 < y < 2. Then there are level £ waiting times a such that 

UX{e,t)\a]<y+-^ m Vee£V?e[T] 

Proof. We abbreviate m:=Dg. First recall that on level £, (1) blocks have length m; (2) the child blocks 
have length \fm and (3) the waiting time on the next level £ + 1 is from [1, m 1/8 ]. 

We define Y{e, t) := E[X{e, t) | a] and consider Y{e, t) as a random variable only depending on 
a. Since the waiting times on levels 0, . . . , £ - 1 are already fixed, we know exactly the level ^-block in 
which packet i will cross edge e — let a,- i£ > be the random waiting time for that block. Then we can 
write 

k 

Y{e, f) = £ Pr[X(e, t, i) | a i>e ] (2) 
i=i 

By Lemma[3j(b), we know that Pr[X(e, t, i) | a,-, e ] < —^r for every choice of a,-, e . Thus Y{e, t) is the 
sum of independent random variables in the interval [0, m" 1/8 ] and the Chernov bound (Lemma[T) 
provides 

* [ ne, f ) > r + -L,] , exp ( - i ■ ■ m ™) l e~^ 

Now we want to apply the Lovasz Local Lemma for the events " Y {e, t ) > y + m~ 1 132 " to argue that it is 
possible that none of the events happens. So it suffices to bound the dependence degree by a poly- 
nomial in m. Lemma|3l(b) guarantees that if the event X{e, t, i) is possible at all, then Pr[X(e, t, i)] > 
■^2 > Now, reconsider Equation (2} and let Q[e, t) := {i e [k] | Pr[X(e, t, i) > 0} be the set of packets 

that still have a non-zero chance to cross edge e at time t. Taking expectations of Equation GJ, we see 
that 

2> r >E[F(e,Z:)]= £ Vr[X{e,t,i)]>\Q{e,t)\- — 

ieQ(e.t) m 

and hence \Q{e, t) \ < 2m. This means that each random variable Y{e, t) depends on at most 2m en- 
tries of a. Moreover, consider an entry in a, say it belongs to packet i and block B. This random 
variable appears in the definition of Y[e, t) if e £ B and t belongs to B's time frame - these are just 
m ■ 0{m) many combinations. Here we use that the time difference between entering a level £ block 
and leaving it, is bounded by 0{D(). Overall, the dependence degree is 0(m 3 ). Since the probability 
of each bad event "Y{e, t) > j + m~ 1/32 " is superpolynomially small, the claim follows by the Lovasz 
Local Lemma and the assumption that m > A is large enough. □ 

We apply this lemma for £ = 0, . . . , L - 1 and the maximum load after any iteration will be bounded 
by (£V)" 1/32 < 2 for A large enough. The finally obtained random variables X(e, t, i) are almost 

deterministic — just the waiting times on level L are still probabilistic. But again by Lemma[3j all non- 
zero probabilities Pr[X(e, t, i)] are at least T^im = O(l), thus making an arbitrary choice for them 
cannot increase the load by more than a constant factor. Finally, we end up with a schedule with load 
0(1). 



4 Providing constant size edge buffers 

Now let us imagine that each directed edge (u, v)eE has an edge buffer at the beginning of the edge. 
Whenever a packet arrives at node u and has e as next edge on its path, the packet waits in e's edge 
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Figure 3: Visualisation of the waiting time for a single level /-block. Black edges are assigned to the 
considered block and are labelled with the waiting time. 



buffer. But a packet i is still allowed to wait an arbitrary amount of time in s ; - or t{. 

In the construction that we saw above, it may happen that many packets wait for a long time 
in one node, i.e. a large edge buffer might be needed. However, as was shown by Leighton, Maggs 
and Rao |LMR94|, one can find a schedule such that edge buffers of size 0(1) suffice. More precisely, 
|LMR94| found a schedule with load 0(1) in which each packet waits at most one time unit in every 
node — after stretching, this results in a schedule with load 1 and 0(1) buffer size. 

In fact, we can modify the construction from Section [3] in such a way that we spread the waiting 
time over several edges and obtain the same properly. Consider the dissection from the last section. 
Iteratively, for £ = 1, . . . , L, shift the level ^-blocks such that every level £ -I boundary node lies in the 
middle of some level ^-block, see Figure H] (note that we assume that D(-\ is an integral multiple of 
D(). Fix a packet i and denote the edges of its path by P; = [e\,..., e^), then we assign all edges ej 
whose index j is of the form (1 + 2Z) • 2 q to level L - q (for q e {0, . . . ,L - 1}). For example, this means 
that all odd edges are assigned to the last level; the top level does not get assigned any edges. 

Now we again define random waiting times for packet i and a block B: on level £ > 1, each block 
picks a uniform random number x e [1, Wg]. The packet waits on each of the first x edges that are 
assigned to the block. Moreover, it waits on each of the last Wg - x edges that are assigned to the 
block. Observe that regardless of the random outcome, the packet will wait at most once per edge 
since edges are assigned to at most one level. Using the convenient bound 2 L ~ e < D l J 8 for A large 
enough, we see that all level-^ randomization takes place within the first and last D^ /8 edges of each 
block, see FigureEJ 

The top block does not get assigned any edge, so instead for each packet i, we pick a value x e 
[1,D] at random and wait x time units in stu 

Reinspecting LemmaHJ we observe that LemmaHJ b) holds without any alterations and Lemma[3j a) 
holds as long as the considered edge e has a minimum distance of from the nearest level £ + 1 
boundary node. Surprisingly, also Lemma |4] still holds with a minor modification in the claimed 
bound. 

Lemma 5. Let £ e {0, . . . , L - 2} and suppose that we already fixed all waiting times on level 0, . . . , £ - 1 . 
LetX{e, t) be the corresponding conditional random variables and assume j > max ee E,te[T]{^lX[e, t)]} 
and 1 < y < 2. Then there are level £ waiting times a such that 

E[X{e,t)\a]<y+—^ m \/eeEVte[T] 

5 If for a block, due to the shifting, some or all waiting edges are shifted "before" the source S[ , then the packet just waits 
the missing time in Sj . 
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Figure 4: Shifted dissection (with m := D^). Regions in which randomization takes place are depicted 
in darkgray (observe these regions do not overlap for consecutive levels) . 



Proof. Again abbreviate m := D( and consider 

k 

Y(e, t) := E[X(e, t) \ a] = £ Pr[X(e, r, I 

(=i 

as a random variable only depending on a (recall that a;> is the random waiting time for that level 
^-block in which packet i crosses edge e). For a fixed edge e, for one of those levels £' e {£ + 1,£ + 2}, 
the edge e is at least | m 1/4 edges away from the next level £' boundary node (see FigureH). Consider 
the level £' -block B that contains e. As already argued, all randomization takes place on the first and 
last D 3 ' 8 < -D^® = m 3/16 « |m 1/4 edges (for m > A large enough). So we can still apply Lemma[3ja) 
for level £' to obtain Pr[X(e, t, i) \ a.i :e ] < < -^m- Again by the Chernov bound (i.e. Lemma[I]with 
5 := -Trie , £ ■= 77^764 . M := T - 1) we have 
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Next, note that still Pr[X(e, t, i) \ a; e ] > given that this probability is positive. Thus from now on 
we can follow the arguments in the proof of Lemma|4] The dependence degree is still bounded by 
0(m 3 ), thus the claim follows by the Lovasz Local Lemma since 4 • 0(m 3 ) ■ e~ m 112 < 1 for m > A 
large enough. □ 

Again, we have initially E[X(e, t)] < 1 for all e and t, then we fix the waiting times iteratively on 
level 0, . . ., L- 2 using Lemma[5]and make an arbitrary choice for the waiting times of level L - 1 and 
level L. This results in a schedule of length 0(D) and load 0(1), in which packets wait at most one 
time unit before entering an edge. 



5 A (1 + £)• (C + D) lower bound 

In this section, we prove that there is an instance in which the optimum makespan must be at least 
(1 + £) • (C + D), where £ > is a small constant. The graph G = {V,E) is defined as depicted in Figure[5] 
(the formal definition follows from the definition of the paths, which we will see in a second). Edges 
gj = {Ui,Vi) are called critical edges, while we term {Vi,Uj) back edges. We want to choose paths 
P\,...,P n as random paths though the network, all starting at si := 5 and ending at f ; - := t. More 
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Figure 5: Graph G = {V,E). The bold edges depict the path P, originating from the identity permuta- 
tion iii = {1,2,..., n). 



concretely, each packet i picks a uniform random permutation 7T, : [n] — [n] which gives the order in 
which it moves through the critical edges e\,...,e n .\n other words, 

Pi = {S,s', U ni [X), V ni [X), U Mi (2), Vj ( (2),...„I%,(b), v Ki{ri), Un+l, t). 

Then the congestion is n and the dilation is 2n + 3. We consider the time frame [1, T] with T = (3 + e) n 
and claim that for e > small enough, there will be no valid routing that is finished by time T. 

Theorem 6. Pick paths Pi,...,P n at random. Then with probability 1 - e~ n( " Z \ there is no packet 
routing policy with makespan at most 3. 000032n (even if buffers of unlimited size are used). 

First of all, clearly the makespan must be at least C + D-l ~ 3n since all paths have the same 
length D and all packets must first cross edge {s, s') . So if we allow only time (3 + e) n, then there is only 
a small slack of en time units. One can show that the number of different possible routing strategies 
is bounded by 2 o( " ' (for e — ► 0). In contrast, we can argue that a fixed routing will fail against random 
paths with probability 2~ n( " Then choosing e small enough, the theorem follows using the union 
bound over all routing strategies. 

We call a packet i active at time t if it is traversing an edge. We say a packet is parking at time t if 
it is either in the end node f; nor in the start node s/. We say a packet is waiting if it is neither active 
nor parking. 

5. 1 The number of potential routing strategies 

Consider a fixed packet i and let us discuss, how a routing strategy is defined. The only decision that 
is made, is of the form: "How many time units shall the packet wait in the k-th node on its path (for 
k = 0, ...,D)". It is not necessary to wait in s' since a packet could instead move to w^d) an d wait 
there. Moreover, it is not needed to wait in one of the nodes Vj, since instead it could also wait in the 
next uy node on its way (the reason is that if there would be a collision on a back edge ( v JTj , u JTj y+u ) 
with packet i' ■£ i, then this packet i' has crossed the critical edge {u Mj (j), v ni (j)) together with i in the 
previous time step, so there was already a collision). In other words, the complete routing strategy for 
packet i can be described as a (n +2) -dimensional vector Wj £ Z"^ 2 , where W;y is the time that packet 
i stays in node uj (for convenience, we denote s also as uq). Then L"=j Wij is the total waiting time 
and for i e [n] and W{q is the time that i parks in the start node. 
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Independently from the outcome of the random experiment, we know the time when each packet 
crosses the edges incident to s and to t. We call W a candidate routing strategy, if there is no collision 
on (5, s') and [u n+ \, t) and the makespan of each packet is bounded by (3 + e)n. 

Recall that H{8) = 8 log i + (1 - 6) log j^g is the binary entropy function^. Then we have: 

Lemma 7. The total number of candidate routing matrices W is at most2 ( * (£)+otl)) '" 2 , where := 
tf(lfi) •(! + £)• 

Proof. First of all, the parking times in s and the total waiting time T."=l ^ij for a packet i are between 

and (1 + e)n < 2n; thus there are at most {2n) 2n = 2 o( " 2) many possibilities to choose them. 

Thus assume that the total waiting time e\ n - Y."=l Wfy for packet i is fixed. Then the number of 
possibilities how this waiting time can be distributed among nodes u\,..., u n+ i is bounded by 



'(»+« + (£//!)- 1 



where we use the bound ( 5 ^) < 2 H(8)m with m= (1 + e,)n and 6 = yf^. 

Next, let us upperbound the total waiting time Of course, the waiting time must fit into 

the time frame of length T = (3 + e) n. Since edge (5, s') can only be crossed by one packet at a time, 
the cumulated time that the packets spend in the start node is at least T."Zo T ~ " , The same 

amount of time is spent by all packets in the end node. Moreover, the packets spend at least 2n 2 time 
units traversing edges. We conclude that 

" n 2 {l-o{\)) n 2 (l-o(l)) 2 7 
nJ^ei^nT 2n 2 = (£ + o(l))n 2 , 

i=l Z Z 

thus£" =1 f, < [£ + o{l))n. Once the values £\,... ,e n are fixed, the total number of routing policies for 
the n packets is hence upperbounded by 

2 <s ' [£i)n = 2"^i"=i*fe0 < 2 " 2 a>(^i:" =1 i;i) < 2 n 2 (i>(£-)+o(i)) 

(=1 

Here we use Jensen's inequality together with the fact that $ is concave. The claim follows. □ 



The important property of function <S apart from concavity is that lim £ ^o <!>(£) = 0. Note that for 

10' 



< £ < one can conveniently upperbound <S(£) < 2 1 - 5£lo s^'" 




Figure 6: Function $. 



'Here log is the binary logarithm. 
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5.2 A fixed strategy vs. random paths 



Now consider a fixed candidate routing matrix W and imagine that the paths are taken at random. 
We will show that this particular routing matrix W is not legal with probability 1 - e n(n ' . For this sake, 
we observe that there must be Q(n) time units in which at least a constant fraction of packets cross 
critical edges. For each such time unit the probability of having no collision is at most (i) n(n) and the 
claim follows. The only technical difficulty lies in the fact that the outcomes of values ni ( j) and m ( /) 
for the random permutations are (mildly) dependent. 

Lemma 8. Supposes < ^. Let W be a candidate routing matrix. Then take paths Pi,..., P n at random. 
The probability that the routing scheme defined by W is collision-free is at most ( j|)" 2/128 . 

Proof. For time t, let /5 T nbe the number of packets that cross one of the critical edges at time t, thus 
T.j=i Pt = n (note that the )S T 's do not depend on the random experiment). Let p := Pr T£ [r] IPt ^ \] De 
the fraction of time units in which at least j packets are crossing a critical edge. Then 



3 + e 



3r 1 
-= E [jB x ]<l-p + (l-p)- 7> 

relT] 4 



which can be rearranged to p > jq for e < ^ . In other words, we have ^ > ^ n =: k many time units 
t = {ti , . . . , Tfc} in which at least j many packets are crossing an edge me\,...,e n . Let A(r) be the event 
that there is no collision at time t. Then we can bound the probability of having no collision at all, by 
just considering the time units in t: 



Pr 



/\Mt)\ < f[Pr[A(T;)| A(ti),...,A(t ; -_i)] < 

T = l ; = 1 



'1M 

,16 



15 

,16 



It remains to justify the inequality (*). 

Claim. For all j = l,...,k one hasVr[A(.r j) \ A(ti),...,A(t ; _i)] < (yf) n/8 . 

By Pi (t) we denote the random variable that gives the edge that i traverses at time t (in case that 

i is waiting in a node v, let's say that Piir) = {v, v)). Let := {P/(ti) P,(t ; _i)} n{ei,...,e n } be the 

critical edges that packet i has visited at ti,...,t/_i. It suffices to show that Pr[A(Ty) | Ei,...,E n ] < 
(y|)" /16 , i.e. we condition on those edges Let / Q [n] with |/| = -| be the indices of packets that 
cross a critical edge at time tj. We split / into equally sized parts / = I1OI2, i.e. IAI = I/2I = f • Consider 
the critical edges E* := {P/(t ; ) | i e f^} which are chosen by packets in I\. If |£*| < -| then we have a 
collision, so condition on the event that \E* \ = ^. Now for all other packets i e f 2 , the edge Pj(T/) is a 
uniform random choice from {e\,. .., e n }\Ei. Thus we have independently for all i e I2, 

\E*\Ei\ n/8-n/l6 1 

Pr[P;(T/) eE*] = > = — , 

\{ei,...,e n }\Ei\ n 16 



since \E t \ < k= j^. Thus 



Pr 



A{Tj) \ \E*\ = —; E\,...,E n <Pr f\ P^tj) t E* \ \E*\ = - ; E 1 ,...,E n 
8 J I (A 8 



15 
1 16 



nIS 



and the claim follows. 



□ 
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Finally one can check that for £ := 0.000032 and n large enough one has 



'15 1 
,16, 



n 2 /128 



. 2 (<P(£) + o(l))n 2 < l 



and Theorem[6]follows. 

Observe that in our instance, C and D are within a factor of 2 or each other. In contrast, if C » D, 
then there is a schedule of length (1 + o(l)) • C and buffer size 0(|j), see |Sch98, Chapter 6]. 

Acknowledgements. The author is very grateful to Rico Zenklusen for carefully reading a prelimi- 
nary draft. 
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